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ABSTRACT 

Transcription is initiated when RNA polymerase 
recognizes the duplex promoter DNA in the closed 
complex. Due to its transient nature, the closed 
complex has not been well characterized. How the 
initial promoter recognition occurs may offer 
important clues to regulation of transcription initi- 
ation. In this article, we have carried out single-base 
pair substitution experiments on two Escherichia coli 
promoters belonging to two different classes, the 
-35 and the extended -10, under conditions which 
stabilize the closed complex. Single-base pair sub- 
stitution experiments indicate modest base-specific 
effects on the stability of the closed complex of both 
promoters. Mutations of base pairs in the -10 region 
affect the closed complexes of two promoters differ- 
ently, suggesting different modes of interaction of 
the RNA polymerase and the promoter in the two 
closed complexes. Two residues on <r 70 which have 
been suggested to play important role in promoter 
recognition, Q437 and R436, were mutated and 
found to have different effects on the closed- 
complex stability. DNA circular dichroism (CD) and 
FRET suggest that the promoter DNA in the closed 
complex is distorted. Modeling suggests two 
different orientations of the recognition helix of the 
RNA polymerase in the closed complex. We propose 
that the RNA polymerase recognizes the sequence 
dependent conformation of the promoter DNA in 
the closed complex. 

INTRODUCTION 

The first step in the transcription process is the formation 
of the closed complex in which the promoter DNA is 



recognized in the double-stranded form (1). The recogni- 
tion of the promoter DNA in the duplex state is followed 
by a series of steps, leading to the separation of strands, 
the open-complex formation and the initiation of 
phosphodiester bond formation (2). Several base pairs 
have been found to be important for the overall transcrip- 
tion initiation process (3). However, it is often not clear at 
which step(s) these base pairs exert their effect. One im- 
portant goal is to identify the kinetic steps and determine 
the role of each base pair in each step (2) leading to a 
structural understanding of the open-complex formation 
process. Roles of some of the bases, such as — 11A have 
been identified (4). The closed complex has proven to be 
difficult to study at room temperature or above, due to its 
transient nature. In a study of X-P K by deHaseth and co- 
workers (5), a quadruple mutant of the a 70 -holoenzyme 
was used to stabilize the closed complex and partial single- 
base pair substitution experiments were performed to 
identify base pairs important for the closed-complex for- 
mation. They observed weak base-specific effects on the 
stability of the closed complex. 

If such weak base-specific effects are the rule in the 
closed-complex recognition, then a more quantitative 
technique to study the contribution of each base pair to 
the binding energy may yield more accurate results. 
Fluorescence anisotropy has been widely used to obtain 
quantitative equilibrium binding data (6). It has been 
established that at 4°C RNA polymerase forms a stable 
closed complex (7). We have previously used fluorescence 
anisotropy at 4°C to obtain quantitative binding iso- 
therms of the RNA polymerase and the duplex 
promoter DNA (8). Using the same methodology, in 
this article, we have studied the effect of single-base pair 
substitution on closed complexes of two different classes 
of promoters. Both the promoters exhibit weak base sub- 
stitution effects in the —10 region, but the role of individ- 
ual base pairs are different in two promoters. We thus 
conclude that the mode of recognition of duplex DNA 
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in individual promoters are different, but involves indirect 
readout. 



MATERIALS AND METHODS 

Materials 

RNA polymerase holo-enzyme and core-enzyme 
were purchased from Epicenter. Sephadex G-50, Ni-NTA 
sepharose and pre-packed FPLC column MonoQ were 
from GE Healthcare. Netropsin dihydrochloride, BSA 
and DTT were from Sigma Chemical Company 
(St Louis, MO, USA). MES was from JT Baker. Oregon 
green isothiocyanate 488, succinimidyl ester of 5(6)-Alexa 
Fluor 488 carboxylic acid and 5(6)-Alexa Fluor 594 
carboxylic acid were from Molecular Probes, Invitrogen. 
DNA grade acetonitrile, anhydrous acetonitrile, dichloro- 
methane, trichloroacetic acid/dichloromethane, tetrazole/ 
acetonitrile, 1-methylimidazole/tetrahydrofuran, acetic 
anhydride/pyridine/tetrahydrofuran, 0.02 M iodine/water/ 
pyridine/tetrahydrofuran, dAbz, T, dGiBu and dCbz 
phosphoramidite and all starting columns with immo- 
bilized nucleosides were purchased from Applied 
Biosystem. 5'TFA-amino C6 modifier was from 
Biosearch Technologies. Some 5'-C6-amino-linked oligo- 
nucleotides were synthesized in-house by a 3400 DNA syn- 
thesizer (Applied Biosystems) and some others were 
purchased from Trilink BioTechnologies. 

Labeling of oligonucleotides 

After purification by MONO-Q, oligonucleotides were 
labeled in 100 ul solution containing 1 M sodium carbon- 
ate/bicarbonate buffer (pH 9.0): DMF: water = 5:2:3. 
Reaction was carried out for 20 h at 25°C with a dye to 
oligonucleotide ratio of 20:1. After incubation, the 
reaction mixture was loaded onto a Sephadex G-50 (GE 
Healthcare) column and eluted and dialyzed against H 2 0. 

Synthesis and purification of oligonucleotides 

The complementary oligonucleotides were annealed in 
10 mM Tris-HCl buffer, pH 7.4 containing 100 mM 
NaCl and 1 mM EDTA by placing the tube containing 
the mixed oligonucleotides in water at 95°C for lOmin 
and cooling at room temperature without any disturbance. 
The duplex oligonucleotides were then purified by 
MONO-Q column in FPLC using 0-1 M KC1 gradient 
in lx PBS (5 mM potassium phosphate buffer, pH 7.4 
containing 137 mM NaCl and 2.7 mM KC1) and dialyzed 
against H 2 0. 

Fluorescence anisotropy 

Fluorescence anisotropy measurements were performed at 
0°C and 4°C by using Quantamaster 6 (PTI) T-geometry 
fluorometer. The titrations were carried out in 50 mM 
MES buffer, pH 6.4 containing 0.2 M NaCl, 10 mM 
MgCl 2 , lOOug/ml BSA, 1 mM DTT, 1 mM EDTA and 
10% glycerol. The temperature was maintained by 
circulating water bath and water vapor was purged from 
the chamber by a steady stream of dry nitrogen gas. 
Fluorescence anisotropy was measured with excitation at 
490 nm and emission at 526 nm using bandwidths of 5 nm. 



The DNA concentrations were 1 nM for the galPl- 
promoter and 2nM for the P R -promoter. When higher 
DNA concentrations are used, it is explicitly stated in 
the figure legend. After mixing with RNA polymerase, it 
was incubated for 3min before fluorescence anisotropy 
measurements. Each point was measured five to six 
times and an average anisotropy value was obtained and 
fitted to a single-site binding equation. Some selective 
binding isotherms are shown in Supplementary Figure S3. 
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where A obs is the observed anisotropy, A { is the anisotropy 
of the protein-DNA complex, K d is the apparent dissoci- 
ation constant, A\ is the anisotropy of the free template, 
[R] t is the total RNA polymerase concentration and [D] is 
the total template concentration. 

Correction of the observed dissociation constant for 
end-binding 

For an oligonucleotide duplex, D, we assume that there is 
only two binding modes. These are promoter binding and 
end binding. If RNA polymerase is represented by R the 
following multiple equilibrium would represent the RNA 
polymerase binding to the oligonucleotide. We also 
assume that under the conditions studied, the DNA is 
largely singly ligated (that is one ligand is bound). This 
is reasonable as in most of the titrations the RNA poly- 
merase concentrations are not very high compared to the 
lowest dissociation constants. 

D+R ======= DR (1) 

D+R ======= DR 2 (2) 

D+R=======DR 3 (3) 

where K represents the promoter binding constant, K e rep- 
resents the end binding constants and Equations (2) and 
(3) represent binding to two ends of the oligonucleotide. 

[DR] = K[D][R] 
[DR 2 ] = K e [D][R] 
[DR 3 ] = K e [D][K\ 
[D] T = [D]+[DR]+[DR 2 ]+[DRi] (4) 
or [D] T = [D]+K[D][R]+K e [D][R]+K e [D][R] 
or [D] T = [D](\+K[R]+2K e [R]) 
or [D] = [D] T /(l+K[R]+2K e [R]) 

Fractional saturation Y may be written as 

Y=([D] T -[D])/([D] r or Y 
= 1 - l/(l+K[R]+2K e [R]) (5) 
= K\R]+2K e [R]/(l+K[R]+2K e [R]) 
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The observed binding isotherm was fitted to a single-site 
binding equation to extract an apparent binding constant 

D+R=~====DR (6) 

IDR] = K\D][R] (7) 

[D] T = [D]+[DR] (8) 

or [D] T = [D]+K, PP [D][R] (9) 

or [D] = [D] r /(l+K app [R]) 

or [DR] = [D] T -[D] = [D] T -[D] T /{l+K app [R]) 
= [D] r {l-l/(l+K app [R])} 

or [DR] = [D] T {K app [R]/(\+K app [R])} 

From these equations, one can obtain a fractional 
saturation (F), 

Y = [DR]/[D] T = ^app[i?]/(l+^app[i?]) (10) 

By comparing Equations (5) and (10) 
We may conclude that 

Kapp = K+2K e 

Fluorescence resonance energy transfer 

5'-C6 amiiio-liiik oligonucleotides were purchased from 
Trilink BioTechnologies and reverse phase HPLC purified 
before use. The oligonucleotides were reacted with a 
succinimidyl ester of 5(6)-Alexa Fluor 488 carboxylic acid 
(Alexa 488) or 5(6)-Alexa Fluor 594 carboxylic acid (Alexa 
594). The coupling reaction was performed by mixing the 
DNA strand with a 10-fold excess of the dye molecule in 
200 mM sodium carbonate buffer (pH 9.0) and leaving the 
mixture standing in the dark overnight at room tempera- 
ture. After the unreacted dye had been removed by gel fil- 
tration, the labeled strand was purified by FPLC on a 
MonoQ column using 0-1 M KC1 gradient in 1 x PBS. 
After FPLC, salt was removed by gel filtration. 
Concentrations of the labeled DNA strands and the 
labeling ratio (dye/DNA) were determined spectrophoto- 
metrically using extinction coefficients of 71000M/cm at 
495 nm for Alexa 488 and 73000M/cm at 590 nm for 
Alexa 594. For DNA concentration measurement, the 
dye absorbance at 260 nm was corrected using the following 
formula: A real = A obs - (A Xmax x CF). ^ max is the peak ab- 
sorbance value of the coupled dye and CF is the correction 
factor (0.27 for Alexa 488 and 0.40 for Alexa 594). In this 
manner, we were able to confirm that each DNA strand 
contained only one dye molecule. Double-stranded DNA 
was made by mixing the appropriate complementary 
strands in an equimolar ratio (for the doubly dye-labeled 
DNA), heating to 95°C H 2 0 for lOmin and cooling slowly 
to room temperature over a period of 12-14 h. 

Steady-state fluorescence spectra were recorded at 4°C 
by using Quantamaster 6 (PTI) T-geometry fluorimeter. 
The bindings were carried out in 50mM MES buffer, 
pH 6.4 containing 0.2 M NaCl, 10 mM MgCl 2 , 100(.ig/ml 



BSA, ImM DTT, 1 mM EDTA and 10% glycerol. 
The fluorescence spectra of oligonucleotides and the oligo- 
nucleotide complexes were taken at oligonucleotide con- 
centration of 4 nM and RNA polymerase concentration of 
40nM, respectively. The buffer-only spectrum was sub- 
tracted from the oligonucleotide spectra and the protein- 
only spectrum was subtracted from the complex spectra. 
Fluorescence excitation spectra were recorded from 480 
to 610 nm at the emission maximum set to 620 nm with 
slits set at 5 nm for both excitation and emission. 

Circular dichroism spectroscopy 

CD measurements were done on a JASCO (Tokyo, Japan) 
J720 spectropolarimeter using a 1-cm pathlength quartz 
cuvette. The scan speed was 120nm/min. Five scans were 
signal averaged to increase the signal to noise ratio. The 
CD spectra of oligonucleotides and the oligonucleotide 
complexes were taken at oligonucleotide concentration of 
200 nM and RNA polymerase concentration of 400 and 
600 nM, respectively. The buffer-only spectrum was sub- 
tracted from the oligonucleotide spectra and the protein- 
only spectrum was subtracted from the complex spectra. 
These experiments were carried out in 50 mM MES buffer, 
pH 6.4 containing 0.2 M NaCl, lOmM MgCl 2 , lOOug/ml 
BSA, 1 mM DTT, 1 mM EDTA and 10% glycerol at 4°C. 

Site-directed mutagenesis, cloning and purification 
of the a 70 mutants 

The mutant proteins were purified following the protocol 
of Jin and co-workers (9) with certain modifications. 
Site-directed mutagenesis were done at Arginine 436 and 
Glutamine 437 of E. coli a 70 to generate the alanine 
mutants using the His-tagged wild-type plasmid by 
PCR-based site-directed mutagenesis kit (Agilent 
Technologies). Both the clones were done in pET28a. 
Sequencing of the whole gene showed that the plasmids 
contain the desired mutations only. 

The plasmids encoding the mutant proteins were trans- 
formed in BL21 (DE3) competent cells. The cells were 
grown overnight in 100 ml LB medium with 2% glucose 
and lOOug/ml kanamycin at 37°C. The next day, the 
entire 100-ml culture was diluted 1:10 in fresh LB 
medium (1000 ml) with 2% glucose and 100(ig/ml kana- 
mycin and was grown at 37°C until ^4 60 o~0.6. IPTG was 
added to a final concentration of 1 mM. The culture was 
grown at 30°C for an additional 4h and then harvested. 

The cell pellet was resuspended in 40 ml buffer B con- 
taining 20 mM Tris-HCl buffer, pH 8.0, 500 mM NaCl, 
20mM imidazole, ImM PMSF, ImM P-Me, 10% 
glycerol and 0.1% Triton X and then disrupted by sonic- 
ation. The cell lysate was centrifuged for 40 min at 23 000 g 
at 4°C. The supernatant was allowed to bind for 1 h with 
1 ml Ni-sepharose pre-equilibrated with buffer B. It was 
loaded onto a column and then washed subsequently with 
20 ml buffer B and 10 ml buffer B containing 80 mM imid- 
azole. Nickel bound proteins were eluted with 80-300 mM 
imidazole gradient in buffer B. The fractions containing 
mutant a 0 were initially dialyzed against 50 mM 
Tris-HCl buffer, pH 8.0 containing 50 mM NaCl, 10% 
glycerol, O.lmM EDTA, ImM (3-Me and 0.1% 



Nucleic Acids Research, 2013, Vol. 41, No. 1 369 



Triton X. It was finally dialyzed against buffer containing 
50mM Tris-HCl buffer, pH 8.0 containing 50mM NaCl, 
50% glycerol, 0.1 mM EDTA, 1 mM DTT and then stored 
at -70°C. Both R436A-a 70 and Q437A-ct™ proteins were 
purified according to the above protocol. 

RNA polymerase holo-enzyme containing mutants a 70 
were then reconstituted by mixing core RNA polymerase 
and the mutant a 70 in ratio 1:1.3 in protein storage buffer 
(50 mM Tris-HCl buffer, pH 8.0 containing 50 mM NaCl, 
50% glycerol, 0.1 mM EDTA, 1 mM DTT). After mixing, 
it was incubated at 25°C for 30min and finally stored 
at -20°C. 

Modeling 

The model of a 70 subunit of E. coli RNA polymerase was 
docked to the A-Pr and gal promoter models with expert 
interface of HADDOCK 2.1. HADDOCK (10,11) is an 
information-driven flexible docking tool for modeling of 
biomolecular complexes. The starting structure for 
docking was the homology-modeled structure of E. coli 
a 70 RNA polymerase. The homology model was built 
for residues 359-613 using RNA polymerase holoenzyme 
of Thermus thermophilus (PDB id: 1IW7) (12) as template, 
using Insight II (Accelrys Inc., San Diego, CA, USA). The 
1-P R and gal promoters were modeled using 3D-DART 
(13). The bases from —40 to —1 were modeled for X-Pr 
and —30 to —1 for gal promoters. For introduction of 
bending in the DNA models, twist, roll and slide base 
pair step parameters for the TATA bases (corresponding 
to TATG in gal and GATA in A-P R promoter) and the 
base —26 were changed within conformational space 
allowed for these bases (14,15). 

The residues 589, 433, 434 and 437 of the sigma subunit 
of the RNA polymerase were defined as active residues for 
docking with 1-P R promoter. The bases —12, —10, —11 
from —10 region and —35 from —35 region were defined 
as active residues for X-P K promoter. Distance restraints 
were provided between bases —11, —12 and —35, and 
residues 433, 437 and 589, respectively. For docking with 
gal promoter, residues 433, 434 and 458 of RNA polymer- 
ase were considered as active residues whereas bases —10, 
— 11 and —14 were considered active residues for the gal 
promoter. Both the DNA molecules were considered 
as fully flexible. Docking was performed in solvated 
mode with water as solvent. The docking protocol 
consists of three stages, a rigid-body energy minimization, 
a semi-flexible refinement in torsion angle space and a final 
refinement in explicit solvent. In expert interface, during 
the rigid body energy minimization, 1000 structures were 
calculated and the best 200 solutions, based on the inter- 
molecular energy, were used for the semi-flexible simulated 
annealing followed by an explicit water refinement. 

RESULTS AND DISCUSSION 

Transient molecular associations, like the closed complex, 
are difficult to analyze due to their short-lived nature. One 
of the best ways to study a transient species is to stabilize it 
under certain conditions. It has been observed that at 4°C, 
the closed complex does not proceed to the open complex 



and hence it can be studied under equilibrium conditions 
(16). One-time tested way to explore the important inter- 
actions in protein-nucleic acid complexes is through 
systematic base and amino acid substitutions (17). 
Single-base pair substitution effects are most effectively 
surveyed if the binding assays are done at equilibrium 
and quantitatively. Fluorescence anisotropy is an estab- 
lished technique to study protein-DNA interactions at 
equilibrium. This assay requires an end-labeled oligo- 
nucleotide duplex and hence interference from RNA poly- 
merase end-binding can be significant under certain 
conditions (18). In previous studies, conditions were 
reported in which the end-binding is substantially 
reduced, making the quantitative assay of promoter 
binding in the closed complex possible (8). Thus, we 
have chosen fluorescence anisotropy assay at 4°C to 
obtain binding isotherms of RNA polymerase to 
promoter DNA in the closed-complex state. 

Base-c interactions in the )u-P R promoter in the 
closed complex 

Since many of the previous studies on the closed complex 
were performed on X-P K promoter, we chose this promoter 
to study the —35 class of promoters (Figure 1A). 
Figure IB shows the binding isotherm of RNA polymer- 
ase and wild-type A-P R promoter (contained in an oligo- 
nucleotide duplex: named PR) at 4°C using fluorescence 
anisotropy. As a control, a binding isotherm of oligo 
dA.dT (dAT-20) and RNA polymerase is also shown 
(Figure 1C). The dissociation constants derived from 
promoter binding isotherms are considered as apparent 
dissociation constants (-K d app ) as the binding isotherm is 
a result of both promoter (stronger) and end (weaker) 
binding. The dAT-20 binding isotherm is substantially 
weaker than the promoter binding and likely results 
from the end binding of RNA polymerase. The 
dAT-20: RNA polymerase K d was used to correct the 
effect of competing end binding reactions in the 
promoter binding isotherms to obtain the actual dissoci- 
ation constant (K d act ). The details of the correction pro- 
cedure is given in 'Materials and Methods' section. Figure 
ID shows the AAG values of single-base substitution 
effects in the —10 region for the wild-type X-P R 
promoter. The base substitution strategy used was to sub- 
stitute non-complementary purines for pyrimidines and 
vice versa, e.g. G to T, A to C, etc. The most striking 
feature of single-base pair substitution effects was that it 
is weak, if any, for all the base pairs studied. Only signifi- 
cant effects in the —10 region in the wild-type template are 
seen for —13 (weakening) and —12 (strengthening upon G 
to T substitution) positions. Strengthening of binding of 
similar magnitude upon G to T substitution at —12 
position was also observed previously (5). To our know- 
ledge, the effect of — 13-bp substitution on the closed 
complex has not been observed before in the A-P R 
promoter. deHaseth and co-workers (5) also observed 
that T7C substitution led to ~2. 4-fold weakening of the 
binding constant in the G12T mutant background. Thus, 
equilibrium binding was also studied in the T7C- 
substituted template in the G12T background. We also 
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Figure 1. Effect of base pair substitution in A-P R promoter. (A) The sequence of A-PR promoter. The red-colored bases are the —35 element and the 
blue-colored bases are the —10 elements. (B) Binding isotherm of E. coli RNA polymerase with wild-type A-P R promoter using fluorescence 
anisotropy at 4°C. (C) Binding isotherm of dAT-20 oligomer duplex under identical conditions. (D) AAG values of single-base pair mutants of 
A.-P R promoter. 



observed ~2-fold weakening of RNA polymerase binding 
under closed-complex conditions. Almost identical results 
obtained with two different techniques strongly suggest 
that there is only weak base specificity in the closed 
complex of 1-Pr. 

Base pairs that stabilize the closed complex at galPl are 
different from ^-Pr 

Promoters in E. coli has several elements that influence 
their activity. They are —10 region, —35 region, UP 



element, the TGx sequence, the discriminator base and 
the spacer between the —10 and —35 regions (19). In one 
major class of promoter, the —35 element plays a crucial 
role (—35 class). In another major class of promoters —10 
and TGx sequence at —14 plays the crucial role (extended 
— 10) (20). The strength of —35 class promoters are 
strongly dependent on the nucleotide sequence present 
in the —35 region whereas the activity of extended —10 
promoters does not depend on nucleotide sequences in the 
—35 region; instead they are strongly dependent on the 
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nature of the nucleotide present at the —14 position. The 
1-P R promoter belongs to the —35 class. Hence we have 
attempted to identify the base pairs that are recognized in 
the closed complex of a promoter belonging to the 
extended —10 class. We chose the galPl promoter of 
E. coli as it is one of the most intensively studied pro- 
moters of this class (21). The galPl promoter however, 
partially overlaps with another promoter, the galP2 
(Figure 2A). The galP2 transcription start site is 5 bp 
upstream of the start site of the galPl (22). Initially, 
four mutations at —14, —13, —12 and —11 of the galP2 
with respect to the P2 transcription start site (+1) was 
chosen as the template to reduce competition from RNA 
polymerase binding at galP2. Single-base pair substitution 
experiments with this template (called P2~) indicate that 
like the A.-P R , base substitutions effects are also weak 
when compared to base substitution effects in lambda 
operator sites; however, unlike the A-P R , 3 bp (—14, —10 
and —8) contribute significant-specific binding energy to 
the closed complex (Figure 2B). Interestingly no contribu- 
tion is made by —12 and —7 bp, ones that contribute sig- 
nificant binding energy to the ^-Pr promoter (—7 in G12T 
background). Among these 3 bp, the —10 bp appears to 
contribute the most base-specific binding energy, 
followed by the —8. Contribution of the —14 bp appears 
to be the weakest. The galPl triple mutant (—14, —10 and 
—8) in the P2~ background also binds weakly when 
compared to the wild-type galPl. Similar results were 
obtained in another mutant background which contained 
mutations in both galP2 and another weak promoter 
galP3, present in the oligonucleotide sequence used (see 
later). In a previous article, Burr et al. (21) have concluded 
that the removal of the TG elements results in significant 
lowering of the isomerization constant, k t - but not the K B . 
This is fully consistent with our result that the —14 bp 
contributes very little specific binding energy to the 
closed complex formation. Interestingly, C to A substitu- 
tion in the — 1 3 position results in tightening of binding, 
suggesting that the presence of a purine in this position 
contributes some specific binding energy to the closed 
complex. Previous bioinformatic analysis suggested the 
importance of purines in the —13 position (23). The 
effect of substitution in the —13 position is opposite in 
the A-Pr promoter, indicating significant difference in 
the structure of the two closed complexes. Despite the 
difference in the contribution of the individual base 
pairs in the —10 region to the specific binding energy, 
the contribution of base-specific binding energy to the 
overall binding energy is similar in the closed complex in 
a different class of promoter as well. We have also per- 
formed binding experiments for selected mutants at 0°C, 
as extended —10 promoters form open complexes at lower 
temperatures than the —35 class promoters (16). Binding 
isotherms determined at 0°C gave almost identical results 
as those at 4°C (Figure 2C). 

Since the galP2 also belongs to the extended —10 class, 
it may be argued that the four mutations introduced in the 
galP2~ template may not be sufficient to significantly 
weaken polymerase binding to the galP2 at the closed- 
complex level as two important bases (the —10 and the 
—8) that contribute significant specific binding energy in 



the galPl are left out (8). In addition, the oligonucleotide 
sequence used here for binding experiments also contains 
a cryptic promoter, the galP3 (24). We thus created 
another template which contains —14. —10 and —8 
mutation with respect to the galP2 as well as the galP3 
(termed P2 = P3~). Sequences of all the oligonucleotides 
are given in the Supplementary Tables SIS 1 1 . Selective 
mutations with respect to the galPl in this template 
(P2 = P3~) indicated same base pairs (particularly —10 
and —8 with respect to galPl) of galPl contributing to 
the base-specific binding energy in the closed complex 
(Figure 2D). This suggests that the results obtained in ex- 
periments described above are not significantly impacted 
by the presence of galP2 and galP3 promoters. Since 
extended —10 class promoters elements are confined 
within the —1 to —20 bp, and the specific binding energy 
is contributed by —14, —10 and —8 bp, we reasoned that a 
shorter DNA fragment from +1 to —20 may bind well 
to the RNA polymerase. A 21 -bp duplex oligonucleotide 
(Pl-21) containing the sequence of +1 to —20 binds 
to RNA polymerase with high affinity 
(K d act = 4.3 ± 1.6nM) containing the galP2" mutations 
(—14 through —11). When the three mutations (—14, 

— 10 and —8) with respect to the galPl was introduced in 
this template, it bound with ~ 10-fold lower affinity, 
indicating that this smaller oligonucleotide faithfully 
reproduces the binding characteristics of the larger oligo- 
nucleotides (Supplementary Figure SI). Thus, it is very 
likely that in the galPl, only 3 bp contribute any signifi- 
cant base-specific binding energy to the closed complex. 
The differential sensitivity of the X-Pr and the galPl to 
base substitutions are strongly suggestive of two different 
modes of interaction of the RNA polymerase with two 
different classes of promoters. 

Residues on <t 70 play different roles in GalPl and X,-P R 
closed complexes 

The two different modes of interaction of RNA polymer- 
ase with two different classes of promoter inferred from 
the base substitution experiments suggest that amino acid 
residues on a 70 subunits may also have different roles in 
the two closed complexes. Two such possible residues are 
Q437 and R436. Darst and co-workers (25,26) have pre- 
dicted that Q437 of Ecct 70 may be close to the base pair 

— 12 of —35 class promoters and may even interact with 
the base pair. Gralla and co-workers (27) have used an 
EMSA assay to show that two mutations, R436S and 
Q437S results in abrogation of promoter binding at 4°C 
in the former case and tightening of promoter binding in 
the latter case. We have thus attempted to use fluorescence 
anisotropy assay to explore the roles of the two residues in 
the closed complex of the galPl and the A.-P R promoters. 
Figure 3A shows the binding isotherms of wild-type and 
Q437Aa 70 RNAP with the wild-type A.-P R promoter. The 
A d act of wild-type RNAP:A.-P R promoter is 7.7 ± 0.3 nM. 
In contrast, Ka act derived from the binding isotherm of the 
Q437Aa 70 -holoenzyme is 14.9 nM, which is ~2-fold 
weaker than that of the wild-type RNA polymerase. In a 
similar way the Kj iCt of galPl:Q437Acj 70 RNAP was 
derived from the binding isotherm and was found to be 
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Figure 2. Effect of single-base pair substitution on galPl promoter. (A) Sequence of galPl promoter. The red-colored base is the —14 and the 
blue-colored bases are the —10 elements. (B) AAG values with respect to galPl + P2~P3 + promoter, plotted for all the single-base pair mutants tested 
at 4°C. The sequence at the top shows the P2— mutations shown in green. (C) AAG values of selected mutants at 0°C with respect to galPl + P2~P3 + 
promoter. (D) AAG values of single-base pair mutants in the galP2~P3~ background at 4°C. All the experimental conditions are described 
in 'Materials and Methods' section. The sequence above shows the P2~ and P3~ mutant gal promoter with P2~ mutations in magenta color 
and P3~ mutations are in orange color. 



0.76 nM. This value is almost identical to that of the 
galPl-wtRNAP complex, suggesting that Q437 does not 
contribute specific binding energy in the formation of the 
closed complex with the galPl promoter. The roles of the 
residue 437 is different in the two closed complexes. 
A similar exercise was carried out with R436A mutation 



and the AAG values are shown in Figure 3B. The binding 
defect detected in this case with 1-Pr template is smaller 
than that of the R436S on the same promoter detected 
with EMSA. Gralla and co-workers (27) reported that 
they have detected almost complete abrogation of 
duplex binding upon R436S substitution. In our assays. 
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Figure 3. Effect of er mutations on the binding in the closed complex. (A) Binding isotherms of E. coli RNA polymerase (open circles) and E. coli 
RNA polymerase-CT 70 -Q437A (solid circle) with wild-type A.-P R promoter at 4°C. (B) AAG values of Q437A and R436A mutants for wild-type galPl 
and A-Pr promoters with respect to the wild-type RNA polymerase at 4°C. (C) AAG values of Q437A and R436A mutant RNA polymerases for 
different galPl mutant promoters from the corresponding wild-type template at 4°C. 



we see significant weakening of binding upon R436A 
mutation, but the effect appears to be much smaller 
than that of Gralla and co-workers. This could be due 
to the serine versus alanine effect, but more likely it 
could reflect the difference between two assays. As was 
stressed above, the fluorescence anisotropy method is a 
true equilibrium technique, whereas the EMSA involves 
kinetic separation of two or more species in the gel. Under 
certain conditions, EMSA tends to overestimate binding 
defects (28). We discuss other possibilities later. In order 
to find out if Q437A and R436A mutant RNA polymerase 
is behaving like the wild-type polymerase in promoter rec- 
ognition, we have carried out binding of these mutant 
polymerases with several mutant galPl templates. Figure 
3C shows the AAG values of the mutant templates in 
reference to the corresponding non-mutant template. 
Clearly, the AAG values are qualitatively similar to the 



wild-type polymerase, suggesting that the binding modes 
of the two mutant polymerases are similar to that of the 
wild-type polymerase. We can tentatively conclude that 
the RNA polymerase recognizes two different promoters 
differently in the closed complex. 

Promoter DNA is distorted in the closed complex 

From their study of T7 RNA polymerase and promoter 
interaction, Patel and co-workers (29) have suggested that 
the closed complex may contain bent or distorted DNA. 
Others have also suggested the possibility that the bending 
of the DNA may play crucial role in recognition and 
isomerization of closed complexes (30). Thus, we have at- 
tempted to find out if the DNA in the closed complex is 
distorted. CD spectra of DNA is a sensitive monitor of 
DNA distortion (31). Protein CD spectra in this wave- 
length region is orders of magnitude weaker than the 
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DNA CD spectra and for most purpose can be neglected 
in comparison with that of the DNA (32). Figure 4A 
shows the CD spectra of the galPl promoter DNA (oligo- 
nucleotide duplex PI) in the absence of RNA polymerase 
and in its presence (two concentrations) at 4°C. CD 
spectra of the DNA in the presence of RNA polymerase 
is distinctly different from the spectra of the DNA alone, 
suggesting significant distortion of the promoter DNA in 
the closed complex. To obtain a more quantitative assess- 
ment of the nature of DNA distortion, we have attempted 
to measure FRET from two ends of the promoter DNA 
duplex in the closed complex. The promoter DNA that we 
have used so far contains 63 bp and hence are ~225A 
long. It is difficult to obtain a FRET pair that can 
measure FRET at such a long distance. 

Thus we have used the 21-mer duplex DNA described 
above (PI— 21), containing the region +1 to —20 bp of 
galPl. The two strands of this shorter DNA (containing 
hexyl amine at the 5'-end) are separately labeled with 
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Figure 4. (A) Circular dichroism spectra of galPl + P2~P3 + DNA (oligo 
duplex PI) with and without E. coli RNA polymerase (black line), 
400 nM RNA polymerase (green line) and 600 nM RNA polymerase 
(blue line) at 4°C. (B) Fluorescence resonance energy transfer 
between two ends of a 21-mer DNA (PI— 21) encompassing the 
extended —10 elements of galPl promoter (+1 to —20; galPl + P2~) at 
4°C. The two spectra shown are excitation spectra in which the peaks 
at 590 nm have been normalized. Donor only spectra and buffer have 
been subtracted from the excitation spectra of donor + acceptor and 
acceptor only, respectively. 



donor (Alexa 488) and acceptor (Alexa 594) FRET pairs 
and annealed. The FRET was measured by comparing the 
excitation spectra of the donor and the acceptor labeled 
promoter DNA (^d+a) with that of a duplex DNA in 
which only the acceptor was labeled (F A ) (Figure 4B). 
Fd+kIFa is a function of the FRET efficiency (33). The 
calculated FRET efficiency of the free DNA was 0.155 
which translates to a distance of 79.6 A. The distance 
between the two ends of the 21-mer DNA should be 
~72 A; if one assumes that the hexyl amine linker stretches 
for another few As, the FRET data are in excellent agree- 
ment with the calculated distance. When, the FRET was 
measured in the closed complex (Figure 4B), the FRET 
efficiency increased to 0.19, possibly indicating somewhat 
closer approach of the two ends. The distortion observed 
in the CD spectra and the change in efficiency seen in the 
FRET experiments argues in favor of a modestly distorted 
DNA in the closed complex. 

Modeling of the closed complex 

Two different modes of interaction of RNA polymerase 
with two promoters belonging to two different classes 
were inferred from the single-base pair substitution 
studies. In order to find out plausible spatial relationships 
of the promoter DNA and the a 70 subunit, we have at- 
tempted to model the closed complex for two different 
classes of promoters by flexible docking program 
HADDOCK. To test whether the RNA polymerase 
binds through the major groove, we have titrated a 
pre-formed galPl:RNA polymerase closed complex at 
4°C with netropsin, a minor groove binding antibiotic 
(34). Supplementary Figure S2 shows the change in an- 
isotropy of the galPl closed complex as a function of 
netropsin concentration. The anisotropy value decreases 
modestly as a function of netropsin concentration but 
quickly levels off. The final anisotropy value is much 
higher than that of the free promoter DNA even at 
200 uM concentration of netropsin. Clearly netropsin is 
not a competitor for the RNA polymerase binding site 
on the DNA. We thus conclude that RNA polymerase 
does not interact with the promoter DNA (—10 
hexamer) in galPl through the minor groove. Similar 
results were obtained for the 1-Pr promoter. We have 
put constraints by inserting known experimental proxim- 
ity relations to constrain many possible structures. The 
details of the procedure and the constraints used are 
given in 'Materials and Methods' section. The two 
complexes are shown in Figure 5A. To model the 
complexes, a bend in the promoter DNA was introduced 
by changing the roll, twist and slide parameters of the TA 
and TG steps in the —10 hexamer. The interacting part of 
the a 70 is docked on to the promoter DNA after defining 
the active residues in the HADDOCK program. Both the 
complexes interact through the major groove of the 
promoter, consistent with the netropsin titration. 
Orientation of the recognition helix (residues 428-445) 
are different in the two complexes, resulting in 
somewhat different interaction pattern of the residue 
437. The details of the interaction is shown in Figure 5B. 
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Figure 5. (A) Docking of the interacting part of rj to the A-P R promoter (left panel) and the galPl promoter (right panel). (B) Enlarged pictures 
showing possible interactions in the respective panels. 



The most striking feature of the results presented in this 
article is that the stability of the closed complex is not 
significantly compromised by any single-base pair substi- 
tution in the —10 region. Only a few-fold effect is seen for 
the —10 and —8 bp in the galPl and —7 and —12 in the 
A-P R promoter. The magnitude of the effect is smaller 
than that of several other DNA binding proteins where 
single-base pair substitution effects are known. In X-Cl 
and A.-Cro, for example, substitutions of bases that form 
direct hydrogen bonds with the protein side chains lead to 
10 2 - to 10 3 -fold change in the dissociation constants 
(17,35). A likely explanation of the weak base substitution 
effect observed in this study is that the RNA polymerase 
reads out the sequence dependent conformation of the 
promoters rather than the base sequence itself. Using dif- 
ferent techniques, similar weak base-specific effect in 
promoter recognition has been noticed in other studies 
before. Gross and co-workers (36) made several constructs 



of sigma factors lacking the inhibitory region 1.1. In sigma 
factors, the region 1.1 prevents free sigma factors from 
binding to promoters. Gross and co-workers had 
observed that A 1.1 sigma factors bind promoters tightly, 
presumably in a closed complex like fashion as sigma 
factors without the core enzyme are unlikely to proceed 
to the open complex. They also observed that these con- 
structs bound non-specific DNA with high affinity as well. 
The observed promoter versus non-specific DNA discrim- 
ination by the A 1.1 sigma factors alone was ~5-fold. The 
discrimination of promoter versus non-specific DNA in 
the closed complex observed in this study is of similar 
magnitude, suggesting that promoter versus non-specific 
DNA discrimination in the closed complex is not very 
large. This is consistent with the hypothesis that RNA 
polymerase recognizes the base sequence dependent con- 
formation of the promoters. Feklistov and Darst (3) also 
suggested that the recognition of the —10 element in the 
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closed complex is complicated. They suggested that speci- 
ficity of recognition of the —10 element is a result of 
coupling of non-specific recognition of the duplex state 
and specific recognition of the flipped bases. Our results 
are consistent with this hypothesis and in physiological 
temperatures, these two steps may indeed be strongly 
coupled resulting in transient nature of the duplex state 
in the closed complex. However, at least in the present 
study, at low temperatures, the duplex state seems to be 
uncoupled from the base-flipped state. This is evident 
from lack of substantial effect of —11 substitutions on 
the dissociation constants. 

Indirect readout of sequence dependent conformation 
of the DNA has been noticed for several protein-DNA 
complexes (37,38). Indirect readout may also be 
accompanied by significant distortion of the DNA (39). 
The DNA CD and FRET studies indicate a modest dis- 
tortion of the DNA. At this stage the exact nature of the 
DNA distortion is not known; however, we note that there 
are more than one TA or TG steps in the promoters (de- 
pending on the promoter sequence) which can easily adopt 
a non-canonical conformation (15). We speculate that the 
binding of the RNA polymerase to a promoter causes the 
duplex DNA to distort. It is interesting to note that two 
relatively major base substitution effect observed here 
(—12 at 1-Pr and —10 at galPl) results in loss of TA or 
TG step. It has been suggested previously that upon 
binding of RNA polymerase to the promoter DNA, 

— 1 1 A base flips out initiating the strand separation. It 
has been proposed that the interaction of flipped out 

— 1 1 A base with aromatic residues in the sigma factors 
stabilize the flipped conformation (3,4,40). The putative 
DNA distortion observed here may also aid the flipping 
out of — 1 1 A by destabilizing the DNA duplex. 
Previously, a hypothesis has been forwarded in which 
the DNA distortion is seen to play a crucial role in 
initiating the strand separation (30). 

A key question for the future is why RNA polymerase 
prefers an indirect readout of the sequence in the closed 
complex? One distinct possibility lies in the fact that 
direct hydrogen bonding with the base pairs may not be 
suitable for subsequent steps in the open-complex forma- 
tion pathway. Structure of the open complex suggests large 
conformational change in the DNA and the protein from a 
putative closed complex form (41). Strong hydrogen bond 
formation may require breakage of these bonds in subse- 
quent steps as well, which may increase the activation 
energies of the steps down-stream. Indirect readout may 
be a compromise between the requirement of the steps 
down-stream and initial recognition of the promoters. 
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